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GENETIC LOCI INDICATIVE OF PROPENSITY FOR LONGEVITY AND METHODS 
FOR IDENTIFYING PROPENSITY FOR AGE-RELATED DISEASE 



Related Applications 

[0001] This application claims the benefi/ of U.S. Application Nos. 60/224,263 and 
60/249,921, the disclosures of each of which/is incorporated by reference herein. 

Government Support 

[0002] This invention was made with Government support under Grant No. AG1 8721 -2 
awarded by the National Institutes of Health. The Government may have certain rights in this 
invention. 

Field of the Invention 

[0003] This invention relates to polymorphic markers indicative of the propensity for 
longevity and methods for using such markers, and more particularly to genetic loci on a 
specified region of human chromosome 4 associated with exceptional longevity. 

Background of the Invention 

[0004] It has long been suspected that a genetic predisposition exists for longevity in 
humans. For example, many studies have documented the inheritance of human longevity. 
McGue et al 9 J. Gerontol. Biol. Sci., 48: B237-44 (1993); Ljungquist et al, J. Gerontol Biol 
Set, 53: M441-46 (1998). Studies have also documented that centenarians (individuals who live 
for 100 years or more) are more likely than non-centenarians to have siblings who are long-lived. 
In particular, one study has shown that the siblings of centenarians have an approximately four- 
fold greater probability of survival to age 91 than siblings of non-centenarians. Perls et al 9 
Lancet, 351 : 560 (1998). A more recent study has shown that female siblings of centenarians 
have over an eight- fold greater probability of survival to age 100 than siblings of non- 
centenarians and male siblings of centenarians have over an eleven- fold greater probability of 
survival to age 97 than siblings of non-centenarians. 
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[0005] Individuals who achieve exceptional longevity, such as centenarians, tend to live the 
majority of their lives in excellent health, demonstrating a rapid decline only at the end of their 
lives. Hitt et aL, Lancet, 354: 652 (1999). A relative lack of polymorphic variants associated 
with diseases of aging may be one prerequisite to achieving exceptional longevity. For 
example, the absence of genetic polymorphisms among centenarians is exemplified by the rarity 
of the apolipoprotein E e-4 allele that has been associated with Alzheimer's disease and 
cardiovascular disease. Schachter et aL, Nat. Genet, 6: 29-32 (1994); Rebeck et aL, Neurology, 
44: 1513-16 (1994). Another prerequisite to achieving exceptional longevity may be the ability 
to modulate the rate of the aging process, which also appears to have a genetic component. For 
example, one study has shown that the offspring of centenarians had more favorable lipid profile 
characteristics compared to ethnically matched and unmatched controls. Barzilai et aL, J, Am. 

Q Geriatr. Soc., 49: 76-79 (2001). These and other observations indicate that there may be one or 

. ^ more genetic loci that influence longevity. 

ry 

CO [0006] Genetic studies in other species including mammals indicate that specific genetic 

;I polymorphisms have powerful influences upon life span (defined by the age of the oldest 

Id 

fU member of the species). A number of studies on non-human species indicate that a relatively 

p few genetic polymorphisms have a powerful influence upon the ability to achieve exceptional 

f9 longevity. Many of those polymorphisms appear to play roles in basic mechanisms of 

p metabolism and aging. 

O 

i& [0007] Only recently have researchers begun to investigate a genetic link to exceptional 
longevity. One approach to determining the significance of genes having an influence on 
longevity is to screen for polymorphisms in human homologs and to determine the allelic 
frequencies among specific human phenotypes, such as centenarians, and compare them to 
ethnically matched younger controls. Alternatively, case-control studies can be performed on a 
priori-selected candidate genes chosen because of their hypothesized roles in fundamental 
mechanisms of aging. These basic mechanisms might include modulation of genomic instability, 
for example by DNA repair and anti-oxidant defenses, gene expression, cell proliferation and 
senescence, maintenance of differential function, and signal transduction. The drawbacks of 
such studies include the improbability of picking the right gene to study out of the myriad known 
and unknown genes effecting the process of interest. 



[0008] Knowledge of longevity-associated polymorphisms would not only provide the 
benefit of predicting individual longevity, but would also provide the ability to predict the 
likelihood of age-associated diseases. Making such predictions would allow early prophylaxis 
that would reduce the severity of or would eliminate the occurrence of such diseases. 
Accordingly, there is a need in the art for genetic loci associated with longevity and methods of 
using polymorphic markers indicative of the propensity for longevity. Such genetic loci and 
methods are provided herein. 

Summary of the Invention 

[0009] This invention provides a genetic locus associated with extreme longevity. 
According to the invention, a specified region of human chromosome 4 is linked to the 
propensity for old age. In particular, the invention provides a longevity locus with a linkage at 
the D4S1564 marker (also referred to as AFM248zg9, 248zg9, and Z23817) on human 
chromosome 4. According to the invention, the presence of a familial linkage to the D4S1564 
marker on human chromosome 4 is indicative of a polymorphic variant associated with increased 
likelihood for longevity. Detection of an inherited variant at the D4S1564 marker, or a 
polymorphism within the longevity locus containing the D4S1564 marker, is indicative of the 
propensity for extreme old age. 

[0010] In a preferred embodiment, the invention provides a polymorphic marker contained in 
an approximately 10-20 cM region surrounding the D4S1564 marker. In a further embodiment, 
the invention provides a marker within 20 cM of position 1 12.6 on human chromosome 4. Also 
in a preferred embodiment, the invention comprises methods for detecting a longevity marker in 
a biological sample. Preferably, such methods comprise amplifying DNA in the region of human 
chromosome 4 comprising the D4S1564 marker. The amplification product comprises a region 
of human chromosome 4 that contains the locus associated with longevity. The common 
presence of a variant of the D4S1564 marker among related individuals, at least one of whom 
has lived to old age, is evidence of the presence of a polymorphic variant associated with the 
propensity for old age. 
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[0011] Inherited variants of the D4S1564 marker in association with extreme longevity is 
indicative of the propensity for old age and the likelihood of avoiding disease of old age. For 
example, the invention provides the ability to predict the propensity for diseases such as heart 
disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, ocular disease, and 
numerous others associated with the aging process. Accordingly, methods of the invention are 
useful not only to predict the likelihood of longevity, but are also useful to indicate possible early 
therapeutic intervention to prevent or to lessen the effects of diseases associated with aging. 
Further details of the use of the invention are provided below in the detailed description thereof 

Description of the Drawings 

[0012] The invention is pointed out with particularity in the appended claims. The 
advantages of the invention described above, as well as further advantages of the invention, may 
be better understood by reference to the following detailed description taken in conjunction with 
CO the accompanying drawings, in which: 

Lea 

[0013] Fig. 1 shows multipoint LOD scores for a genome scan on human chromosome 1. 

[0014] Fig. 2 shows multipoint LOD scores for a genome scan on human chromosome 2. 

[0015] Fig. 3 shows multipoint LOD scores for a genome scan on human chromosome 3. 

[0016] Fig. 4 shows multipoint LOD scores for a genome scan on human chromosome 4. 

[0017] Fig. 5 shows multipoint LOD scores for a genome scan on human chromosome 5. 

[0018] Fig. 6 shows multipoint LOD scores for a genome scan on human chromosome 6. 

[0019] Fig. 7 shows multipoint LOD scores for a genome scan on human chromosome 7. 

[0020] Fig. 8 shows multipoint LOD scores for a genome scan on human chromosome 8. 

[0021] Fig. 9 shows multipoint LOD scores for a genome scan on human chromosome 9. 

[0022] Fig. 10 shows multipoint LOD scores for a genome scan on human chromosome 10. 

[0023] Fig. 1 1 shows multipoint LOD scores for a genome scan on human chromosome 1 1 . 

[0024] Fig. 12 shows multipoint LOD scores for a genome scan on human chromosome 12. 
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[0025] Fig. 13 shows multipoint LOD scores for a genome scan on human chromosome 13. 

[0026] Fig. 14 shows multipoint LOD scores for a genome scan on human chromosome 14. 

[0027] Fig. 15 shows multipoint LOD scores for a genome scan on human chromosome 15. 

[0028] Fig. 16 shows multipoint LOD scores for a genome scan on human chromosome 16. 

[0029] Fig. 17 shows multipoint LOD scores for a genome scan on human chromosome 17. 

[0030] Fig. 18 shows multipoint LOD scores for a genome scan on human chromosome 18. 

[0031] Fig. 19 shows multipoint LOD scores for a genome scan on human chromosome 19. 

[0032] Fig. 20 shows multipoint LOD scores for a genome scan on human chromosome 20. 

[0033] Fig. 21 shows multipoint LOD scores for a genome scan on human chromosome 21. 

[0034] Fig. 22 shows multipoint LOD scores for a genome scan on human chromosome 22. 

jig [0035] Fig. 23 shows multipoint LOD scores for a genome scan on human chromosome 23. 

|I 

p [0036] Fig. 24 shows Kaplan-Meier curves depicting the probability of surviving to each age 
~ (conditional on surviving to at least age 20) of male and female siblings (n=2,092) of 
C3 centenarians (n=444) to males and females from the same birth cohort (data from the U.S. Social 
Security Administration's 1900 cohort life table). 
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Detailed Description of the Invention 

[0037] Compositions and methods of the invention are useful for predicting the propensity 
for exceptional longevity in humans. Additionally, compositions and methods of the invention 
are useful for predicting the propensity for age-related diseases including, but not limited to heart 
disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, and ocular disease. 
Additionally, compositions and methods of the invention are useful for indicating possible early 
therapeutic intervention to prevent or to lessen the effects of diseases associated with aging. 

[0038] The present invention provides a polymorphic locus with significant influence upon 
longevity in humans. In particular, the invention provides a longevity locus with a linkage to the 
D4S1564 marker on human chromosome 4. The D4S1564 marker is also referred to as 
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AFM248zg9, 248zg9, and Z23817. The D4S1564 marker is a DNA segment that is 
approximately 33 1 base pairs in length and contains a C-A repeat. Gyapay et ah, Nat Genet., 
7(2): 246-339 (1994), which is incorporated by reference herein. The D4S1564 marker has 12 
alleles with an allele size range of 220-242 base pairs and is located at approximately the 1 12.6 
cM position on human chromosome 4. The D4S1564 marker is available on numerous Internet 
databases (see, e.g., Genbank, http://www.ncbi.nlm.nih.gov/; Genethon, 
http://www.genethon.fr/eng/indeng.html; Stanford Genome Center, http://www- 
shgc.stanford.edu/; Whitehead Institute Genome Center, http://www-genome.wi.mit.edu/; each 
of which is incorporated by reference herein). 

[0039] The following examples provide further details of the polymorphic markers and 
methods according to the invention. Accordingly, while exemplified in the following manner, 
the invention is not so limited and the skilled artisan will appreciate its wide range of application 
upon consideration thereof. 

EXAMPLE 1 

Linkage to Exceptional Longevity on Human Chromosome 4 at Marker D4S1564 

[0040] To investigate genetic markers that confer a substantial survival advantage in humans, 
genome- wide scans were performed on 308 individuals belonging to 137 sibships in which at 
least one sibling was 98 years old and at least one other sibling was at least 95 years old (female) 
or 91 years old (male). The genome- wide scans were performed using a linkage mapping set 
obtained from Applied Biosystems (Foster City, CA). DNA samples from each individual were 
initially genotyped with 400 tandem repeat markers with an average heterozygosity of 0.70 and 
an average marker density of 10 centiMorgans (cM). Size standards and alleles were determined 
using an allele calling program. 

[0041] Loci suggestive of linkage were genotyped with additional markers. Sibships in 
which the observed genotype data was 10 5 -fold more likely to have arisen if the individuals were 
unrelated or half-siblings were excluded from the genetic analysis. In addition, those families 
with aberrantly high numbers of Mendelian inheritance errors were excluded. In suggestively- 



linked regions with higher density mapping, genotypes that implied double-recombinants were 
identified and the raw data were reexamined or retyped. 

[0042] An affecteds-only multipoint non-parametric analysis was performed. This analysis 
tested for excess sharing of chromosomal regions that were identical by descent in the sibships. 
The non-parametric analysis (MLS) was performed using the GENEHUNTER 2.0 program 
(Falling Rain Genomics, Inc., Lincoln, MA). 

[0043] Figs. 1-23 show multipoint LOD scores for a genome-wide scan of 308 individuals 
belonging to 137 families. The multipoint LOD scores in Figs. 1-23 are plotted as a function of 
specific markers on each chromosome. The horizontal threshold lines on each graph represent a 
MLS score of 2.0, a score slightly higher than the average maximum score expected by chance 
once in a genome scan. 

[0044] The genome screen of the 308 individuals identified a region on chromosome 4 as 
highly suggestive of linkage (MLS=2.67), as shown in Fig. 4. Fine mapping at an average of 1 
marker every 3 cM was performed in a 20 cM region around this peak resulting in an increased 
MLS=3.65 at marker D4S1564, as shown in Table 1 below. A drop off of 1.5 in the MLS score 
on either side of the peak MLS defines with 95% confidence the area in which the gene resides. 
A drop off in the MLS of 2 on either side of the peak is observed in a 20 cM region encompassed 
by D4S414 and D4S161 1. Table 1 below shows the MLS scores in this region and the HLod 
scores under the dominant model : 



TABLE 1 



Marker 


Position 


MLS 


Hloddom 


D4S1534 


95.0 


0.57 


0.54 


D4S414 


100.8 


1.53 


1.30 


D4S2986 


105.3 


2.78. 


2.26 


D4S1572 


108.0 


3.07 


2.57 


D4S411 


109.0 


3.07 


2.60 


D4S1564 


112.6 


3.65 


3.26 


D4S406 


117.1 


2.55 


2.15 


D4S1611 


121.6 


1.70 


1.56 


D4S402 


124.5 


1.39 


1.06 


D4S2975 


126.7 


0.94 


0.89 
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[0045] In order to estimate the significance of the observation of linkage to chromosome 4, 
simulations under the hypothesis of no linkage were designed which match the family structure 
and marker heterozygosity as described above. One thousand genome scans were simulated with 
a genome- wide map density of 1 marker every three 3 cM (matching the fine mapping density 
performed in the region of maximum linkage) and in only 44 of those 1000 genome scans was 
the observed MLS of 3.65 exceeded (p=0.044). 

[0046] Previous studies indicate that parametric linkage analyses using both a dominant and 
recessive inheritance model may be more powerful in some cases than non-parametric studies for 
detecting genes contributing to complex traits. In addition, non-parametric methods do not 
easily allow for the presence of locus heterogeneity. Therefore, multipoint genome-wide 
affecteds-only parametric linkage analyses were conducted under both a dominant and recessive 

- % Q model with reduced-penetrance and allowing for heterogeneity. 

. p% 

f U [0047] The finely mapped region of chromosome 4 provided the strongest results, with a 
maximum HLOD of 3.3 under the dominant model (with 45% of families linked) and a 

lf t maximum HLOD of 2.9 under the recessive model (with 57% of families linked), both occurring 

sy 

s at marker D4S 1 564. Empiric p-values were determined by genome-wide simulations under the 

P 

t p dominant model. Of 1000 replicates, 30 had a LOD score greater than 3.3 under the dominant 

H model (p=0.03) and 78 had a LOD score greater than 2.9 under the recessive model (p=0.08). A 

O 

q LOD score of 3.0 or higher is indicative of linkage. Because two models were evaluated, these 
f ^ parametric results are appropriately summarized and corrected by using the more significant 
result by multiplying the associated p-value by 2, thus producing a final parametric p-value of 
0.06. For this dataset and trait, the parametric analyses provide additional support for the 
findings of the non-parametric analyses, but do not provide greater statistical significance. 

[0048] These analyses provide significant evidence for heterogeneity, with odds in favor of 
heterogeneity of 1800:1. The significant linkage based upon a sibling-pair study of modest 
sample size indicates that there is at least one gene of significant importance in conferring 
exceptional longevity. 
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EXAMPLE 2 



Propensity for Siblings of Centenarians of Achieving Exceptional Longevity 



[0049] To investigate the propensity of siblings achieving exceptional longevity, pedigree 
data from the families of centenarians was compiled and analyzed. The pedigrees of 444 
centenarian subjects revealed 2,092 siblings. Ages of death or for those still alive, ages when 
last censored, were tabulated. If there were multiple centenarians in a family, the oldest living 
centenarian was considered the proband. There were 356 female probands (80% of the 
centenarian subjects) with a median age of 106 years with a 95% confidence interval (CI) from 
105 to 107 years. There were 88 male probands (20% of the centenarian subjects) with a median 
age of 107 years with a 95% confidence interval (CI) from 104 to 108 years. The probands had a 
O median of 5 siblings with a range of 1-15 siblings. 

^ [0050] The survival experience of the siblings of centenarians, segregated by gender, was 



CO estimated according the Kaplan-Meier product-limit method. Kaplan et aL, J. Am. Statist. 

Assoc., 53: 457-481 (1958). Standard errors of the survival probabilities were calculated using 



iU Greenwood's formula. Kalbfleisch et al. 9 Statistical Analysis of Failure Time Data (1980). 

£ 

p Standard errors were calculated assuming binomial probabilities at each age interval and using 

fn 

£J the number of centenarian siblings at risk of surviving each age interval as the effective sample 

Q size for that interval. The estimated median survival of the 1,075 female siblings (51% of the 

Q 

^ sample) was 86 years with a 95% confidence interval (CI) from 85 to 88 years. The estimated 
median survival of the 1,017 male siblings (49% of the sample) was 78 years with a 95% 
confidence interval (CI) from 76 to 80 years. Fig. 24 shows Kaplan-Meier curves for both 
gender groups among the individuals who achieved at least the age of twenty years. 

[0051] The propensity (A, s ) of siblings surviving to specific ages was estimated for 
centenarian siblings relative to the U.S. Social Security Administration's cohort life table of 
100,000 individuals from the year. The sibling propensity (k s ) was defined as the probability 
that a centenarian sibling survived to age x given that the sibling survived to age 20, divided by 
the probability that an individual born in 1900 survived to age x, given that the individual 
survived to age 20, where x is greater than 21 years of age. Pointwise 95% confidence intervals 
(CI) for sibling propenisty (X s ) was calculated according to the natural log scale and 
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exponentiated. Data analysis was performed using the SAS v8.0 program (SAS Institute Inc., 
Cary, NC). 

[0052] Table 2 below compares the two groups for those individuals having achieved at least 
the age of 20 years, wherein the sibling propensity (A, s ) according to age and gender is shown. 
The sibling propensity (A, s ) determination was limited to age 97 for males and age 100 for 
females due to sample size. Beyond these ages, the lower 95% confidence interval (CI) dropped 
below 1. 



Table 2 



Survival 
to Age 
(yrs) 


i e Male 


Lower 
95% CI 


Uooer 
95% CI 


Female 


Lower 
95% CI 


UoDer 
95% CI 


21 


1.00 


0.95 


1.07 


1.00 


0.95 


1.06 


30 


1.01 


0.95 


1.07 


1.02 


0.96 


1.08 


35 


1.01 


0.95 


1.08 


1.03 


0.97 


1.09 


40 


1.03 


0.96 


1.10 


1.04 


0.98 


1.11 


45 


1.04 


0.97 


1.12 


1.06 


0.99 


1.13 


50 


1.08 


1.00 


1.16 


1.07 


1.01 


1.15 


55 


1.11 


1.02 


1.20 


1.09 


1.02 


1.17 


60 


1.18 


1.09 


1.28 


1.12 


1.04 


1.20 


65 


1.29 


1.17 


1.42 


1.16 


1.08 


1.25 


70 


1.48 


1.32 


1.65 


1.24 


1.14 


1.35 


75 


1.69 


1.46 


1.95 


1.36 


1.24 


1.49 


80 


2.03 


1.67 


2.47 


1.54 


1.37 


1.72 


85 


2.70 


2.00 


3.65 


1.84 


1.58 


2.14 


90 


4.09 


2.31 


7.23 


2.57 


2.03 


3.25 


91 


4.55 


2.33 


8.90 


2.77 


2.13 


3.61 


92 


5.36 


2.40 


11.97 


3.09 


2.28 


4.18 


93 


5.36 


2.40 


11.97 


3.45 


2.44 


4.89 


94 


5.36 


2.40 


11.97 


3.74 


2.48 


5.65 


95 


8.37 


1.92 


36.37 


4.28 


2.59 


7.09 


96 


9.67 


1.51 


61.82 


4.76 


2.56 


8.88 


97 


11.31 


1.02 


125.4 


5.47 


2.48 


12.09 


98 








6.77 


2.42 


18.94 


99 








8.01 


2.11 


30.33 


100 








8.64 


1.32 


56.48 



[0053] The survival curves remained approximately the same for both groups through young 
adulthood (k s ~ 1). As shown in both Fig. 24 and Table 2, the survival experiences of the 
siblings of centenarians compared to the survival of the 1900 birth cohort began to diverge at 
ages 40 to 50 and became increasingly divergent at age 90 and beyond. For example, the sibling 
propensity (X s ) for men by age 97 was 1 1 .3 and the sibling propensity (k s ) for women by age 100 
was 8.6. Thus, compared to the general population, the siblings of centenarians appear to have a 
markedly increased propensity for achieving exceptional longevity, suggesting a significant 
genetic contribution. 

[0054] While the invention has been shown and described with reference to specific 
preferred embodiments, it should be understood by those skilled in the art that various changes in 
form and detail may be made therein without departing from the spirit and scope of the invention 
as defined by the following claims. 



